Automation of Indian Postal Documents Written in Bangla and English
نویسندگان
چکیده
In this paper, we present a system towards Indian postal automation based on pin-code and city name recognition. Here, at first, using Run Length Smoothing Approach (RLSA), non-text blocks (postal stamp, postal seal, etc.) are detected and using positional information Destination Address Block (DAB) is identified from postal documents. Next, lines and words of the DAB are segmented. In India, the address part of a postal document may be written by combination of two scripts: Latin (English) and a local (State/region) script. It is very difficult to identify the script by which pin-code part is written. To overcome this problem on pin-code part, we have used two-stage artificial neural network based general scheme to recognize pin-code numbers written in any of the two scripts. To identify the script by which a word/city name is written, we propose a water reservoir concept based feature. For recognition of city names, we propose an NSHP-HMM (NonSymmetric Half Plane-Hidden Markov Model) based technique. At present, the accuracy of the proposed digit numeral recognition module is 93.14% while that of city name recognition scheme is 86.44%.
منابع مشابه
International Journal of Applied Science & Technology Research Excellence Vol. 1, Issue 1, Nov-Dec 2011, ISSN NO. 2250 – 2718 (Print), 2250 – 2726 (Online)
In this paper, we present a system towards Indian postal automation based on PIN (Postal Index Number) code. Since India is a multilingual and multi-script country that was earlier colonized by UK, the address part may be written by combination of scripts such as Latin (English) and a local (state) script. Here, we shall consider Oriya script one of the local state language in India with Englis...
متن کاملA Lexicon-Driven Handwritten City-Name Recognition Scheme for Indian Postal Automation
A lexicon-driven segmentation-recognition scheme on Bangla handwritten city-name recognition is proposed for Indian postal automation. In the proposed scheme, at first, binarization of the input document is done and then to take care of slanted handwriting of different individuals a slant correction technique is performed. Next, due to the script characteristics of Bangla, a water reservoir con...
متن کاملHand Written Bangla Numerals Recognition for Automated Postal System
Recognition of handwritten Bangla numerals finds numerous applications in postal system automation, passports and document analysis and even for number plate identification. However, the recognition rate requires high and reliable accuracy for practical applications. This paper delineate a robust hybrid system for recognition of handwritten Bangla numerals for the automated postal system, which...
متن کاملA System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation
In this paper, we present a system towards recognition of Bangla pincode numerals for Indian postal automation. In the proposed system, at first, using structural features the broken numerals are joined. Next combining Neural Network (NN) and tree classifier based approach the numerals are recognized. Considering similar shaped numerals at first, NN classifies the 10 numerals into six groups. N...
متن کاملOverview of FIRE-2015 Shared Task on Mixed Script Information Retrieval
The Transliterated Search track has been organized for the third year in FIRE-2015. The track had three subtasks. Subtask I was on language labeling of words in code-mixed text fragments; it was conducted for 8 Indian languages: Bangla, Gujarati, Hindi, Kannada, Malayalam, Marathi, Tamil, Telugu, mixed with English. Subtask II was on ad-hoc retrieval of Hindi film lyrics, movie reviews and astr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IJPRAI
دوره 23 شماره
صفحات -
تاریخ انتشار 2009